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Abstract 


e1 


The  usual  assumption  in  multivariate  hypothesis  testing  is  that  the  sample  consists  of  n 
independent,  identically  distributed  Gaussian  p- vectors.  In  this  dissertation  this  assumption 
is  weakened  by  considering  a  class  of  distributions  for  which  the  vector  observation  are  not 
necessarily  Gaussian  or  independent.  This  class  consists  of  the  elliptically  symmetric  laws 
with  densities  of  the  form  f(XnXp)  =  g(tr(X  —  M)'(X  —  At)!!-1).  The  following  hypothesis 
testing  problems  are  considered:  testing  for  equality  between  the  mean  vector  and  a  specified 
vector,  lack  of  correlations  among  different  sets,  equality  of  covariance  matrices  and  mean 
vectors,  equality  between  the  correlation  coefficient  and  a  specified  number,  and  MANOVA. 
For  each  of  the  above  hypotheses,  invariant  tests  and  their  properties  are  developed.  These 
include  the  uniformly  most  powerful  test,  the  locally  most  powerful  test,  admissibility,  and 
null  and  non-null  distributions.  Further,  under  the  assumptions  that  p(-)  is  continuous,  each 
element  of  the  covariance  matrix  of  X  is  finite,  and  the  null  hypothesis  is  scalar-invariant,  it 
is  shown  that  the  usual  normal-theory  likelihood  ratio  tests  are  exactly  robust  for  the  null 
case  under  this  wider  class  (i.e.  the  likelihood  ratio  tests,  sampling  from  this  general  class, 
are  the  same  as  the  usual  normal-theory  likelihood  ratio  tests  and  their  null  distributions 
are  the  same.) 


hr 


Chapter  1 


INTRODUCTION 


The  usual  assumption  in  multivariate  hypothesis  testing  is  that  the  sample  consists 
of  independent  and  identically  distributed  Gaussian  vectors.  In  this  paper  we  consider 
the  more  general  hypothesis  testing  problem  when  the  sample  observations,  not  necessarily 
either  Gaussian  or  independent,  are  from  the  family  of  multivariate  elliptically  contoured 
distributions. 

The  elliptically  contoured  distributions  on  the  n-dimensional  Euclidean  space  38”  are 
defined  as  follows.  If  the  characteristic  function  of  an  n-dimensional  random  vector  x  has 
the  form  exp(it'/i)^(t'E*t),  where  ft  :  n  x  1,  23*  :  n  x  n,  rank(E*)  —  k,  E*  >  0,  and 
<f>  €  =  { <f>  |  ${•)  is  a  function  such  that  +  • — htf)  is  a  characteristic  function  on 

38*},  we  say  that  x  is  distributed  according  to  an  elliptically  contoured  distribution  with 
parameters  ft,  E*,  and  <f>,  and  write  x  ~  ECn(ft,'E* 

Elliptically  contoured  distributions  have  been  extended  to  the  case  of  matrices  by 
Dawid  (1977,  1978),  Chmielewski  (1980),  and  Anderson  and  Fang  (1982b). 

Let  X,  M,  and  T  be  n  x  p  matrices.  We  express  them  in  terms  of  elements,  columns, 
and  rows  as 
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X  —  (xij)  —  (xl>  •  •  •  >*p)  — 


x4o' 


,  x  =  vec  X*, 


M  =  (Mij)  =  (/*!>•■■  ,f*p)  = 


/ 

:  |,  /i  =  vecM', 


(1.1) 


(t\ 


T  —  (f|/ )  —  (*!)•••)  tp)  — 


(1) 


,  t  =  vec  T', 


u 


(«) 


where  x  =  vec  X' ^x'^j, . . .  ,xjnj^  with  the  corresponding  meanings  for /i  and  t. 
If  the  characteristic  function  of  a  random  matrix  X  has  the  form 

['  E  ‘(/W)!  *  (‘(l)s*‘(»)>  •••>  *(n)Ent(„))  , 


exp 


(1.2) 


V  /=» 


) 


with  Sj, . . .  ,E„  >  0,  we  say  that  X  is  distributed  according  to  a  multivariate  (rows)  ellip- 
tically  contoured  distribution  (MECD)  and  write  X  ~  MECnXp  (M;  Ei , . . . ,  En ;  <f>).  In  this 
paper  we  consider  only  the  subclass  of  MECD  in  which  the  function  4>  satisfies 


^(tl»-”»*n)  =  <p[tl  + - l-*n)-  (1-3) 

We  continue  to  denote  MECD  in  this  subclass  by  MEC„xp  (M;  Ei , . . . ,  E„;  <f>)  and  <p  by  <f>. 

When  /if!)  =  /i(2)  =  •  •  •  =  /i f„j  =  p  and  Ei  =  •  •  ■  =  E„  =  E,  we  write  X  ~ 
LECnXp(n,'EJ<l>),  and  if  the  density  of  X  exists  (in  this  paper,  this  is  the  usual  assumption), 
then  it  has  the  form 


|E|- »"  g  (tr  E-1G)  , 


(1.4) 
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where  G  =  E/=i  (*(/)  “  »)  (*(/)  -  (*)'■ 

Let  uM  denote  a  random  vector  which  is  uniformly  distributed  on  the  unit  sphere  in 
5R*  and  0,  (|!t||2)  denote  its  characteristic  function.  Schoenberg  (1938)  pointed  out  that  a 
characteristic  function  <f>  6  if  and  only  if 


m  =  r 

Jo 

for  some  distribution  function  F  on  [0,  oo). 


0,  (tr2)  dF(r) 


(1.5) 


Throughout  X  —  Y  means  that  the  random  matrices  X  and  Y  are  identically  dis¬ 
tributed;  rank(A)  denotes  the  rank  of  the  matrix  A. 

Cambanis,  Huang  and  Simons  (1981)  obtained  the  following  two  properties  : 


(i)  x  ~  E*,<f>)  with  rank(E*)  =  Jb  if  and  only  if 


x±p+  RD'uW,  (1.6) 

where  R  >  0  is  independent  of  u^',  E*  =  D'D  is  a  factorization  of  E*  (i.e.,  D  is  a  k  x  n 
matrix  and  rank(D)  =  k)  and  the  distribution  function  F  of  R  is  related  to  4>  as  in  (1.5) 
with  k  substituted  for  q.  For  convenience  we  denote  this  relationship  by  R  <f>  €•  $*.  Here 
x  —  y  denotes  that  the  random  vectors  x  and  y  are  identically  distributed. 

(ii)  Write  J  ,  where  u{n^  is  m-dimensional  column  vector  (1  < 

m  <  n).  Then  (u^*\  (1  -  i22in)*u("-m^,  where  fZmn(>  0),  u(m>,  and 

u(n— n»)  are  independent,  and  R^n  ~  Beta  ,  Q=M). 

Anderson  and  Fang  (1982b)  pointed  out  that  X  ~  MECnXp  (M;  Ei , . . . ,  E„;  <f>)  if  and 
only  if  x  =  vec  X'  ~  ECnp(p,  V,  <f>)  where  V  is  a  matrix  with  diagonal  blocks  E,-,  t  =  1, . . . ,  n, 
and  off-diagonal  blocks  0,  find  p  =  vec  M'.  So  it  is  easy  to  see  that  if  X  LECnxp{t*,E,<l>) 
and  rank(S)  =  l  then 

X  ^  lnp'  +  1ZUD,  (1.7) 

where  ln'  =  (1, 1,  •  •  • ,  1),  U  :  nx<,  vecU  =  u^,  D  :  £  x  p,  D'D  =  E,  and  J2  <-*•  <f>  € 
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When  £  >  0,  X  ~  LECnxp(ti,E,<i>)  with  <f>(t)  =  exp(-§t)  if  and  only  if  xyj,  j  = 

are  i.i.d.  Np(p,  £)  (This  implies  that  sampling  from  a  multivariate  normal  distribution  is  a 

special  case  of  sampling  from  the  family  of  multivariate  elliptically  contoured  distributions.) 

There  are  three  definitions  to  be  emphasized  here  : 


(i)  The  Dirichlet  Distribution  Dm(pi,  •  •  •  ,pm-i',Pm)- 


If  y  =  (pi,  •  •  • ,  ymY  is  a  random  vector  with  Vi  =  1  and  (pi,  •  •  • ,  pm_i)'  bas  the 
density  with  p,-  >  0,  *  =  1, . . . ,  m, 


fm{t  1>  l)  —  \ 


f  *?*"  (‘  -  SSI1  .  if U  >  0. 

*  =  1,  1, 


lo, 


EE1*  si; 

otherwise, 


(1.8) 


we  say  that  (pi,  ••  •  ,Pm-i)'  ~  Ai»(Pl, '  ■ '  ,Pm-i;Pn»)- 

(ii)  The  distribution  MGpjc+i( E;  •  •  • ,  ^). 

If  X  =  (Xi',  •  •  •  jXfc+i')'  /sj  LECnxp(0,  E,  4>)  with  £  >  0,  then  we  say  that  (W^j,  •  •  • , 

W(m))  ~  MGpM1(£;^,-,^;n-^;^),  where  W(0  =  X/X,-,  i  =  1 . *,  X,-,  »  = 

1, . . . ,  k  +  1,  is  a  n,-  x  p  matrix,  p  <  n,-  <  n,  *  =  1, . . . ,  Jfe,  n*+1  >  1,  and  ni  —  n- 


(iii)  Majorization  (x  -<  y) . 


For  x,  y  €  Si”,  x  -<  y  (  y  majorizes  x),  if 


where  X[,-j,  *  =  1, . 


E?=i  xi«i  ^  E*=i  vi«i>  *-!»•••»»  i; 

(1.9) 

■  E"=i  x\*[  —  E?=i  v\  i]» 

n,  is  the  arrangement  of  x,-,  *  =  1, . . . ,  n,  such  that  x^j  >  •  •  •  >  X[nj. 


In  this  paper  invariant  tests,  sampling  from  the  family  of  multivariate  elliptically  con¬ 
toured  distributions,  are  discussed.  Chapter  2  develops  some  general  results  in  finding  the 
distributions  and  power  functions  of  various  test  statistics.  Chapter  3  discusses  invariant 
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tests  in  multivariate  regression  models  and  their  properties  including  admissibility.  Chap* 
ter  4  studies  an  invariant  test  for  equality  between  the  mean  vector  and  a  specified  vector, 
shows  that  the  generalized  T2-test  is  the  locally  most  powerful  invariant  test  under  some 
regularity  conditions  and  is  also  the  only  admissible  invariant  test,  and  discusses  the  mono* 
tonicity  of  the  power  function  of  the  generalized  T2-test.  The  invariant  tests  for  testing 
various  hypotheses  and  their  properties  are  studied  in  Chapter  5.  The  following  hypothesis 
tests  are  considered  :  lack  of  correlations  among  different  sets,  equality  of  covariance  matri¬ 
ces  and  mean  vectors,  known  correlation  coefficient,  known  partial  correlation  coefficient, 
and  zero  multiple  correlation  coefficient. 

Anderson  and  Fang  (1982c)  considered  maximum  likelihood  estimation  and  the  like¬ 
lihood  ratio  test.  In  Chapter  6  we  assume  that  X  has  a  continuous  probability  density 
function,  each  element  of  its  covariance  matrix  is  finite,  and  the  null  hypothesis  is  scalar- 
invariant.  It  is  then  shown  that  the  usual  normal-theory  likelihood  ratio  tests  are  exactly 
robust  for  the  null  case  under  this  wider  class  (i.e.  the  likelihood  ratio  tests,  sampling  from 
this  general  class,  are  the  same  as  the  usual  normal-theory  likelihood  ratio  tests;  and  their 
null  distributions  are  the  same.) 

Anderson  and  Fang  (1982a)  derive  the  distribution  of  a  quadratic  form  for  the  central 
case  for  the  family  of  multivariate  elliptically  contoured  distributions.  Chapter  7  evaluates 
its  non-central  distribution. 


Chapter  2 


GENERAL  RESULTS  FOR  MULTIVARIATE 
ELLIP  TIC  ALLY  CONTOURED 
DISTRIBUTIONS 


In  this  chapter  two  general  results  for  multivariate  elliptically  contoured  distributions, 
used  in  later  chapters,  are  derived.  The  multivariate  normal  distribution  is  a  special  case 
of  a  multivariate  elliptically  contoured  distribution,  but  the  distribution  of  a  certain  class 
of  specified  functions  of  the  multivariate  normal  distribution  is  the  same  as  that  of  the 
multivariate  elliptically  contoured  distribution.  Based  on  this  fact,  if  the  test  statistic  is 
one  of  the  above  specified  functions,  the  power  of  this  test  remains  the  same  as  in  the 
multivariate  normal  case. 

2.1.  The  distributions  of  certain  specified  statistics. 

Anderson  and  Fang  (1982b,  1982c)  proved  that  the  null  distributions  of  the  likelihood 
ratio  statistics  for  testing  lack  of  correlation  between  sets  of  variates,  equality  between 
the  mean  vector  and  a  specified  vector,  equality  of  covariance  matrices,  equality  of  several 
means,  and  MANOVA  in  the  family  of  multivariate  elliptically  contoured  distributions  have 
the  same  distributions  as  in  the  multivariate  normal  case.  Chmielewski  (1980)  also  showed 
that  the  null  and  non-null  distributions  of  all  invariant  statistics  for  testing  the  sphericity 
hypothesis  are  the  same  as  in  the  multivariate  normal  case.  In  this  section  a  general  result 


is  given. 
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C. 


Theorem  2.1. 

Let  Xfixp  /w hfECnxp{M.', Ei,  ...  , Si,S2,  ...  ,  Ej,  ...  ,  Eg,  ...  ,Hq‘,<f>)  with  n,-  S,-  a, 
E,-  >  0,  tAe  ro«M  o/M  are  n,-  /ij  t  =  1,  ...  ,  q,  ]CJ=1  n,-  =  n,  and  fAe  density  o/X  exists. 

(i)  //M  =  0  and  if  the  vector  f(X)  o/  statistics  satisfies  the  condition  that  f(X)  =  f(eX), 
for  every  c  >  0,  then  the  distribution  of  f(X)  does  not  depend  on  <j>. 

(ii)  If  the  vector  f(X)  of  statistics  satifies  the  condition  that  f(X)  =  f(e(X  +  B)),  where 
the  rows  of  B  are  n,-  bj-  for  every  c  >  0  and  b,-  €  5RP,  *  =  1,  ...  ,q,  then  the  distribution 
o/f(X)  does  not  depend  on  <j>. 

Proof. 

(i)  We  can  write 

x  =  vec  X*  —  ,  (2.1) 

where  and  R  are  independent,  D'D,-  =  E,-  >  0,  and  D  is  a  matrix  with  ni  B^s,  . . . ,  ng 
Dg’s  as  diagonal  blocks  and  off-diagonal  blocks  of  0’s. 

Define  f*(vecX')  =  f(X),  then  f*(x)  =  f*(cx)  and  f*(.RD'u(np))  ^  f*(cJ2D'  u^)  for 
every  c  >  0.  Since  the  density  of  X  exists,  Pr(iJ  =  0)  =  0.  So  f*(rB'u("p))  f*(er D'u("p)), 
where  r(rD'u^"p^)  is  the  conditional  random  vector  at  R  =  r  >  0.  Then  by  letting  c  =  r-1, 
f*(rD'u<np>)  it  r(D'uM).  Since  u<np)  and  R  are  independent,  f*(x)  is  independent  of  R 
and  f(X)  is  independent  of  R.  So  the  distribution  of  f(X)  does  not  depend  on  <f>. 


(ii)  Let  B  =  — M  and  Y  =  X  -  M,  then 

f(X)  =  f(e(X  —  M)) 

w  .  (2-2) 

=  f(eY). 

When  c  =  1,  f(X)  =  f(Y).  So  it  is  true  that  f(Y)  =  f(cY),  for  any  c  >  0.  Then  from  (i),  the 
distribution  of  f(Y)  does  not  depend  on  4>.  So  the  distribution  of  f(X)  does  not  depend  on 
<f>.  I 
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Corollary  2.1. 

Let  X„xp  ~  LECnxpifi,  E,  <f>)  with  E  >  0,  and  the  density  ofX  exists. 

(i)  If  p  =  0  and  if  the  vector  f(X)  of  statistics  satisfies  the  condition  that  f(X)  =  f(cX), 
for  every  c  >  0,  then  the  distribution  of  f(X)  does  not  depend  on  <fr. 

(ii)  If  the  vector  f(X)  of  statistics  satisfies  the  condition  that  f(X)  =  f(c(X  +  lnd’)), 
for  every  c  >  0,  and  d  G  9JP,  then  the  distribution  o/f(X)  does  not  depend  on  <f>. 

Proof. 

It  is  clear  by  letting  q  =  1,  Ei  =  E,  px  =  p  in  Theorem  2.1.  I 

The  results  of  Anderson  and  Fang  (1982b,  1982c)  and  Chmielewski  (1980)  are  easily 
obtained  from  Corollary  2.1.  There  are  a  number  of  other  examples  in  Chapters  3,  4,  5, 
and  6. 

2.2.  Power  functions  of  specified  tests. 

The  previous  section  consists  of  general  results  in  distribution  theory.  In  this  section, 
we  will  attack  the  problem  of  power  functions  by  using  the  above  result. 

Theorem  2.2. 

(i)  Assume  Xnxp  ^  h£EGnxp{hd)  Ex,  ...  ,  Sj,  £2,  •  •  •  j  E2,  •  •  •  >  •  •  •  *  )  with 

n,-  E,-  S,-  >  0,  the  rows  o/M  are  n,-  p\ ’s,  i  —  1,  ...  ,q,  and  the  density  ofX  exists.  For 

testing  Ho  :  (M,  Si,  ...  ,S9)  G  flo  vs  Hi  :  (M,Si,  ...  ,S,)  G  fl  \  do,  if  the  hypotheses 
remain  invariant  under  the  group  G  =  {g  |  g  :  X  1-*  c(X  +  B),  where  the  rows  of  B  are  n,- 
b'-  ’s,  c  >  0,  b,-  G  3^,  i  =  l,  ...  ,  q},  and  f(X)  is  an  invariant  test  statistic,  then  the  power 

function  of  this  test  with  rejection  region  /(X)  G  $1  does  not  depend  on  <j>. 

(ii)  Xnxp  ~  LECnXp{p,  S,  <j>),  with  S  >  0,  and  the  density  ofX  exists.  For  testing  Ho 
:  (p,  S)  G  do  vs  Hi  :  (p,  S)  G  d  \  do,  if  the  hypotheses  remain  invariant  under  the  group 
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{g  |  j:Xh  e(X  +  Ind’),  e  >  0,  d  G  3?p},  and  /(X)  is  an  invariant  test  statistic,  then  the 
power  function  of  this  test  with  rejection  region  /(X)  G  Si  does  not  depend  on  <f>. 

Proof. 

(i)  Since  /(X)  =  /(c(X  +  B)),  by  Theorem  2.1,  the  distribution  of  /(X)  does  not 
depend  on  <t>  under  both  the  null  and  the  non-null  cases.  Therefore  the  power  function  of 
this  invariant  test  does  not  depend  on  <f>. 

(ii)  This  is  a  special  case  of  (i).  Using  this,  we  can  complete  the  proof.  S 

Corollary  2.2. 

(i)  In  Theorem  2.2  (i),  iffl  =  B  x  fi*,  fl0  =  B  x  0$,  fl  \  Q0  =  B  x  (Q*  \  flj),  where  B 
is  the  set  of  parameters  M,  fl*  is  the  set  of  parameters  (Ei,  ...  ,  E,),  and  1 lj  C  II*,  /(X) 
is  an  invariant  test  statistic  of  Ho  :  (Ei,  . . .  ,E,)  6  Hq  vs  Hi  :  (Ei,  . . .  ,Eg)  €  II*  \  QJ, 
then  the  power  function  of  this  test  with  same  rejection  region  Si  does  not  depend  on  <j>. 

(ii)  In  Theorem  2.2  (ii),  if  fl  =  B  x  fl*,  Oo  =  Bx  Q$,  fl\flo  =  Bx  (II*  \  QJ),  where 
B  is  the  set  of  parameters  p  ,  fl*  is  the  set  of  parameters  E,  and  fl|J  C  O*,  /(X)  is  an 
invariant  test  statistic  of  Ho  :  E  €  Hg  vs  Hi  :  E  €  fl*  \  OJ,  then  the  power  function  of  this 
test  with  same  rejection  region  Si  does  not  depend  on  ft. 

Proof. 

(i)  It  is  clear  that  the  hypotheses  remain  invariant  under  the  group  {g  |  g  :  X  *-*■  X-f  B 
where  the  rows  of  B  are  n,-  b'-’s,  b,-  G  3CP,  *  =  1,  . ..  , <j}.  Since  imder  the  transformation 
X  «-*■  cX,  c  >  0,  (Ei,  ...  ,E9)  does  not  change  (only  <f>  changes)  (Since  E,-,  i  =  1  ,...,q, 
are  scale  matrix,  i.e.,  for  any  a  >  0,  X  Af£(7nxp(0;Si,  ...  ,Ei,E2,  ...  ,E2,  ...  ,  S q, 
...  ,Ef;^),then  aX  ~  M£CnXp(0;Ei,  ...  ,Ei,E2,  ...  ,E2,  ...  ,E q,  ...  ,Eg;^*),  see 
Cambanis,  Huang  and  Simons  (1981).  )  So  by  Theorem  2.2,  the  power  function  of  this  test 
with  same  rejection  region  Si  does  not  depend  on  <f>. 

(ii)  Similarly,  as  in  (i),  we  can  easily  complete  the  proof. 


1 
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We  will  apply  this  result  frequently  in  the  later  chapters.  In  particular,  under  the  same 
assumptions  as  Corollary  2.2,  if  an  invariant  test  for  the  multivariate  elliptically  contoured 
distribution  is  UMPI  (uniformly  most  powerful  invariant  test)  in  the  multivariate  normal 
case,  then  we  can  claim  that  this  test  is  also  UMPI  in  the  multivariate  elliptically  contoured 
distribution. 


Chapter  3 

MULTIVARIATE  REGRESSION  ANALYSIS 


In  this  chapter,  some  properties  in  a  multivariate  regression  model  are  studied.  The 
sampling  is  from  the  family  of  multivariate  elliptically  contoured  distributions.  Invariant 
tests  are  derived  and  turn  out  to  be  the  same  as  those  dealt  with  in  the  multivariate  normal 
case.  Admissibility  is  a  desirable  property.  Admissibility  and  null  distributions  are  studied. 

3.1.  Invariant  tests  and  null  distributions. 

The  usual  assumption  in  the  multivariate  regression  model  is  that  the  sample  consists  of 
independent  and  identically  distributed  normal  vectors.  Now  we  consider,  the  more  general 
case,  the  following  multivariate  regression  model  sampling  from  the  family  of  multivariate 
elliptically  contoured  distributions  : 

fY"X,  —  Xfi  xqRqxp  +  E„xp> 


[E~  LECnXp{0,VJ), 

where  X  is  known,  q<p<n,  n  —  q>p,  rank(X)  =  q,  and  E  >  0. 

If  the  density  of  Y  exists,  then  the  density  of  E  exists  with  the  form  of 


|E|_5n  j(trE-1(E,E)). 


Hence  the  density  of  Y  has  the  form  of 


|E|"*"  $(tr  E-1(Y  -  XB)'(Y  -  XB)). 


(3.3) 
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Suppose  we  partition 

B=^j,  X  =  (X!  X2),  (3.4) 

such  that  Bi  has  q\  rows,  B2  has  <?2  rows,  Xi  has  91  columns,  and  X2  has  g2  columns.  We 
wish  to  test  JET  :  Bj  =  BJ  against  the  general  alternative  K  :  B\  ^  BJ,  where  BJ  is  a  given 
matrix. 

To  Bnd  the  maximal  invariants  of  the  sufficient  statistics,  we  need  to  find  the  sufficient 
statistics.  Since 

Y-XB  =  (Y-XBn)  +  X(Bn-B),  (3.5) 

we  have 

(Y  -  XB)'(Y  -  XB) 

=  (Y  —  XBn)'(Y  -  XBn)  +  (B„  -  B)'X'X(B„  -  B)  (3.6) 

=  N± n  +  (B«  -  B)'A(Bn  -  B), 


where  N±u  =  (Y  -  XBn)'(Y  -  XBn),  Bn  =  A_1C,  C  =  X'Y,  and 
An  Ai2 
„  A2i  A22 

From  (3.3),  the  density  of  Y  has  the  form  of 


/Au  A12\ 

=  X'X  = 

\  A2i  a22  / 


|E|-*"  g(tr  E-l[^Bn  +  (Bn  -  B)'A(Bn  -  B)]). 


(3.7) 


By  the  Factorization  theorem,  En,  Bn  form  a  sufficient  set  of  statistics  for  E,  B.  Thus, 
there  is  an  one-to-one  correspondence  between  (Bm,^)  and  (Bm,B2n)  and  En,  Bin,  and 
Bfcj  form  a  sufficient  set  of  statistics  for  E,  B.  (Note  that  B^  =  B^  +  (Bm  —  BJ)Ai2A^, 
where  B^  =  A^X^Y  -  XiBl)].) 

We  can  reformulate  the  hypothesis  as  Bi  =  0  (by  replacing  Y  by  Y  —  XjBJ)  and  the 
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problem  remains  invariant  under 

(1)Y  Y  +  x2r, 


(2)Xi  XjC*,  Bi  •-»  C*—1B1, 

where  C*  is  nonsingular, 


(3)Y  h*  YV, 

where  V  is  nonsingular. 

Under  group(l), 

En  En>  Bm  *-*  Bm> 

B2 u  *■■+  B2u»  +  r. 


(3.8) 


(3.9) 


So  the  only  invariants  of  the  sufficient  statistics  are  Eq  and  Bm  (by  letting  T  =  — B2u)). 
Under  group  (2), 


En  •-*  Sq,  Bm  *-*  C*-1Bm, 
Xj  H»XiC*, 


(3.10) 


where  Xj  =  Xi  —  X2AJ21A2i.  Let  CjAn^Cj  =  I  and  C\  be  an  orthogonal  transformation 
such  that 

cr^r^m  =  T,  (3.11) 

where  t,-„  =  0,  *  >  v,  tu  >  0,  and  An-2  =  Xj’Xj  =  An  —  Ai2A22-1A2i.  Since  T  is  a  function 
of  T'T  =  BmAn^Bm,  the  only  invariants  of  the  sufficient  statistics  are  N'Sq  (denote  as 
G)  and  (denote  as  H).  Under  group  (3), 


G  V'GV,  H  h*  VHV, 


(3.12) 
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where  V  is  nonsingular.  Hence  the  only  invariants  (i.e.  maximal  invariants)  of  the  sufficient 
statistics  are  the  roots  of 

|H  -  0G\  =  0.  (3.13) 

We  have  the  following  theorem. 


Theorem  3.1. 

The  respective  maximal  invariants  of  the  sufficient  statistics  for  the  sample  space  are 
the  roots  of  (3.13). 

Since  the  maximal  invariants  of  the  sufficient  statistics  are  same  as  the  ones  in  multi¬ 
variate  normal  case,  the  invariant  tests  are  same  as  the  ones  in  multivariate  normal  case. 
Let  $i  >  $2  >“•>  0p  be  the  roots  of  (3.13).  Some  invariant  test  criteria  are  listed  as 
follows: 


(1)  Lawley-Hotelling  trace: 

p 

trHG-1  =  Yie<- 

•=i 

(2)  Bartlett-Nanda-Pillai  trace: 


(3)  Roy’s  maximum  root: 


trH(H  +  G)-1 


9i- 


(4)  Wilks’  likelihood  ratio  criterion: 


jgj 

|G  +  H| 


= no + 

i=  1 


(3.14) 


(3.15) 


(3.16) 


(317) 


Under  the  null  hypothesis,  by  Theorem  2.1,  the  joint  distribution  of  maximal  invariants 
of  the  sufficient  statistics  does  not  depend  on  <f>.  So  the  null  distribution  of  an  invariant  test 
is  the  same  for  all  ^’s  (i.e.  it  is  same  as  the  one  in  multivariate  normal  case.) 
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Anderson  and  Fang  (1982b)  derived  that  iVEn  is  distributed  as  MGPt2(S;  2ya;  |;^) 
(see  page  4.)  Next  we  want  to  find  the  null  distribution  of  H.  Since 

Bin  =  A^bt'E,  (3.18) 


and 

E  —  i?UnxpDpxp> 


(3.19) 


H  =  BmAu^Bm 

=  E'XaW'Ah.jA^X'E  (3.20) 


=  E'FE, 


where  F  =  XA^'Ah^A^X',  D'D  =  E  >  0,  and 


If  R2  ~  Xnp  (i-e->  E  has  a  multivariate  normal  distribution),  then  H  =  E'FE  ~ 
Wp(E,  qi).  From  Cochran’s  Theorem  for  multivariate  normal  distributions,  we  have  F2  =  F 
and  rank(F)  =  qi.  Then  by  Cochran’s  Theorem  for  multivariate  elliptically  contoured 
distributions  (Anderson  and  Fang  (1982b)),  H  ~  MGP)2( E;  <f>). 


3.2.  Admissibility  of  invariant  tests. 


In  multivariate  normality,  one  way  to  approach  admissibility  is  to  apply  either  Stein’s 
Theorem  or  Schwartz’s  Theorem  (Section  8.10  of  Anderson  (1984)).  In  this  section,  we 
discuss  the  problem  of  admissibility  and  make  an  extension  of  Schwartz’s  Theorem  to  mul¬ 
tivariate  elliptically  contoured  distributions. 


Theorem  3.2. 

Assume  that  (y,m,il*,P*)  is  a  family  of  distributions  and  (]/,  m,  Q,  P)  is  a  subfamily 
of(y,  m,  IT,  P*)  such  that  for  any  bounded  function  f,  Euf(Y)  =  0  Vw  €  fl  implies  f{Y)  = 
0  o.e.  m.  Suppose  and  Qq  are  the  nonempty  proper  subsets  of  Q*  and  Q,  respectively, 
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such  that  Oo  C  OjJ  and  0  \  fio  C  0*  \  OJ.  If  the  test  with  acceptance  region  B  is  admissible 
for  testing  the  hypothesis  that  «  €  flo  against  the  alternative  u  6  Q  \  flo  ,  then  the  test 
with  acceptance  region  B  is  admissible  for  testing  the  hypothesis  that  w  G  QJ  against  the 
alternative  w  G  ft*  \  ft$. 

Proof. 

The  critical  function  of  the  test  with  acceptance  region  B  is  <f>s{y )  =  0,  y  G  B,  and 
<f>B{y)  =  1,  y  $  B.  Suppose  <i>(y)  is  the  critical  function  of  a  better  test,  that  is 


J <^(y)dp*(y)<  J <f>B{y)dp*u{y),  wen;, 


(3.21) 


/  m  dP:(y)  >  J  Mv)  dp'Av),  w  e  n*  \  n;, 

with  strict  inequality  for  some  w;  we  shall  show  that  this  assumption  leads  to  a  contradiction. 

Now  we  restrict  the  above  inequalities  to  0;  we  have 
* 

Ml/)  ~  4>b(v ))  dpu{y)  <0,  w  G  n0, 

(3.22) 

Mv)  -  f>B{y))  dpuiv)  >o,  w  g  n  \  n0. 


/« 


/< 


Since  the  test  with  acceptance  region  B  is  admissible  for  testing  the  hypothesis  that  wG(lo 
against  the  alternative  w  G  fl  \  fio, 


EU[4>(Y)  -  <f>B{Y) ]  =  0  Vw  G  0. 


(3.23) 


So  <}>{Y)  =  <}>b{Y)  a.e.  m.  This  leads  to  a  contradiction. 


Now  we  go  back  to  the  problem  of  the  multivariate  regression  model.  Since  p  >  qi, 
there  are  p  —  ?i  roots  of  (3.13)  identically  0.  It  seems  reasonable  that  if  a  set  of  roots  6  = 
(#!,  ...  ,6qiy  leads  to  acceptance,  then  a  set  of  roots  0*  =  ($1,  ...  , $*t )'  such  that  0?  <  0,-, 
*  =  1,. . .  ,qi,  should  also  lead  to  acceptance.  Such  an  acceptance  set  is  called  monotone.  A 


Chapter  S:  Multivariate  regression  analysis  17 


point  0  =  (0i,  ...  , 07l)'  is  in  the  extended  region  A*  if  the  point  0P  =  (0,-(i),  . . .  ,  0,-(9l))', 
where  *(1), . . . , i(qi)  is  a  permutation  of  1, . . . , qi  such  that  >  •  •  •  >  0,-fa,),  is  in  A.  The 
following  theorem  is  a  generalization  of  Schwartz’s  Theorem. 

Theorem  3.3. 

Let  (y,m,  ft*,  P*)  be  the  family  of  multivariate  elliptically  contoured  distributions  and 
m  be  the  Lebesgue  measure.  If  the  acceptance  region  A  in  the  set  1  >  >  *  •  •  >  0qi  >  0  it 

montone  and  if  the  extended  region  A*  is  closed  and  convex,  then  A  is  the  acceptance  region 
of  an  admissible  test. 

This  theorem  is  proved  using  the  following  corollary  from  Theorem  3.2,  and  Theorem 
3.4.1  of  Ferguson  (1967). 

Corollary  3.1. 

Let  ( y,m ,  fl",  P*)  be  the  family  of  multivariate  elliptically  contoured  distributions  and 
m  be  the  Lebesgue  measure.  If  the  acceptance  region  A  in  the  set  1  >  0X  >  •  •  •  >  0qi  >  0  is 
montone  and  if  the  extended  region  A*  is  closed  and  convex,  then  A  is  the  acceptance  region 
of  an  admissible  test  among  the  class  of  tests  based  on  the  sufficient  statistics  (En,Bn). 

Proof. 

Let  (y,m,Q,P)  be  the  family  of  multivariate  normal  distributions.  It  is  clear  that 
(So,  Bq)  is  the  complete  sufficient  statistics.  Then  this  corollary  follows  from  Theorem  3.2. 

I 


Next  let  us  state  Theorem  3.4.1  of  Ferguson  (1967). 

Theorem  3.4. 

Consider  the  game  (9,  A,  L)  where  the  statistician  observes  a  random  vector  X  whose 
distribution  depends  on  0.  If  T  is  a  sufficient  statistic  for  0,  then  the  set  Dq  of  decision 
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rules  in  D ,  which  are  based  on  T,  forms  an  essentially  complete  class  in  the  game  (9,  D, 

R). 

Note  : 

6  is  the  parameter  space;  A  is  the  space  of  actions;  L  is  a  loss  function;  D  is  the  space 
of  all  behavioral  decision  rules;  and  R  is  the  risk  function  ( see  Ferguson  (1967).) 

Proof  of  Theorem  3.3. 

For  every  fixed  density  g  (see  (3.2)),  from  Corollary  3.1,  A  is  the  acceptance  region  of 
an  admissible  test  among  the  class  of  tests  based  on  the  sufficient  statistic  (En,  Bn).  But 
(En,  Bn)  is  the  sufficient  statistic  for  (E,B).  Then,  from  Theorem  3.4,  A  is  the  acceptance 
region  of  an  admissible  test  for  this  fixed  g.  So  A  is  the  acceptance  region  of  an  admissible 
test.  I 

From  Theorem  3.3,  we  conclude  that  Wilks’  likelihood  ratio  test,  the  Lawley- Hotelling 
trace  test,  the  Bartlett-Nanda-Pillai  trace  test,  and  Roy’s  maximum  root  test  are  admissible. 


Chapter  4 


TESTING  HYPOTHESIS  FOR  MEAN  VECTOR 


In  this  chapter  an  invariant  test  for  equality  between  the  mean  vector  and  a  specified 
vector  is  derived;  other  optimum  properties  derived  in  Chapter  3  are  studied,  hi  the  mul¬ 
tivariate  normal  case,  the  generalized  T2-test  is  the  uniformly  most  powerful  invariant  test 
(UMPI),  but  in  the  multivariate  elliptically  contoured  case,  a  weaker  property  called  locally 
most  powerful  invariant  test  (LMPI)  is  derived  in  section  4.2. 

4.1.  Invariant  test,  null  distribution,  and  admissibility. 

Suppose  X„Xp  ~  LECnXp(p,  E,  <f>),  with  E  >  0,  unknown,  n>p,  and  the  density  of  X 
exists.  We  wish  to  test  H  :  p  =  p0  against  the  general  alternative  K  :  p  ^  p0,  where  /i„  is 
a  known  vector.  We  can  reformulate  the  hypothesis  as  p  —  0  (by  replacing  X  by  X  - 
and  the  problem  remains  invariant  under  the  following  group  : 

X  •-  XC,  (4.1) 


where  C  is  p  x  p  nonsingular. 

This  problem  is  a  special  case  of  multivariate  regression  model,  so  we  have  the  following 


corollaries. 
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Corollary  4.1. 

The  respective  maximal  invariant  of  the  sufficient  statistics  for  the  sample  space  is 
T2  =  n(x  -  /i0)'S-1(x  -  p0)  where  x  =  £x'l„  and  S  =  -  l„x')'(X  -  l„x'). 

Corollary  4.1  implies  that  the  T2-test  whose  acceptance  region  is  T2  <  Aj,  a  constant, 
is  an  invariant  test. 

Corollary  4.2. 

For  the  null  case,  the  distribution  of  T 2  does  not  depend  on  the  distribution  of  the 
radius  R.  That  is,  the  null  distribution  is  unique  for  any  <f>. 

Corollary  4.3. 

T2-test  is  admissible. 

4.2.  Locally  most  powerful  invariant  test. 

In  this  section,  we  want  to  derive  a  nice  result  that  the  I^-test  is  a  locally  most 
powerful  invariant  test.  First  we  need  to  find  the  non-null  distribution  of  T2. 

If  XnXp  ~  LECnXp(p,  E,<f>),n>p>  1,  and  the  density  of  X  exists,  then 

X  4  intf  +  fZUnXpDpxp,  (4.2) 

and 

x  =  -X'l„ 
n 

^/i+  -RD'U'U,  (4-3) 

n 

where  D'D  =  E,  R  ~  F  and  independent  of  U.  So 
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4  n[{p  -  p0)  +  -flD'U'ln]'[fl2D'U'(I  -  ilnlUUD]-1 
n  n 


x{(/i-/i0)  +  -i2D'U,ln] 
n 

= -«,)+ Vijwi  -  ii„i;.)u]-1 

R  ni  n 

i  ,  (4-4) 

x[^-(D-1)^-/.0)  +  -Vu’U 

R  n> 

The  last  equality  of  (4.4)  is  true  because  of  Pr{i2  =  0}  =  0. 

Let 


Y=^(D-l)'(^-^°)  +  4U'1nJ 


na 


and  Q  be  a  p  x  p  orthogonal  matrix  with  the  first  row 


(Y'Y)i 


Y*.  Define 


(4.5) 


V  =  QY 


/  (Y*Y)»  \ 


(4.6) 


V  0  ) 
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and 

B  =  Q[U,(I-ilnl^)U]Q' 

/  hi  B12 
VB21  B22 

then 

m2 

- -  4  Vb-1v 

n  —  1 

=  Y*Ybn 
_ 

ht-2  ’ 

where  6U  is  the  (l,l)th  element  of  B_1  and  bn.2  —  &n  -  Bi2B221B2i  . 
Let  Ebeanxn  orthogonal  matrix  with  nth  row  and 


then 


B  =  QU’UQ'  -  -QU'l„l'  UQ' 
n 

=  QU'(E'E)UQ'  -  iqU  i„l^UQ' 
n 
n 

=  QiYvtOQ'  -  Qunu'„Q' 

1=1 

=  Q(^u,n;)Q', 

«=1 


(4.7) 


(4.8) 


(4.9) 


(4.10) 
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and 


Y=  ~(D_1)'(M-*°)  +  u„.  (4.11) 

Prom  lemma  2  due  to  Cambanis,  Huang  and  Simona  (1981), 


(4.12) 


where  Rx  (>  0),  Ui  ((n  - 1)  x  p)  and  u*  (p  x  1)  are  independent,  and  jR*  /V  Beta(f,^). 
Thus 


B  4  (l  -  /Z^QU^Q' 

=  B(R,  i?i,Ui,u*),  (4.13) 


and 


>~p0)  +  RxyS 
=  Y(R,Rxy). 


(4.14) 


Consequently 


pr(J*  <«1-  Pr  /  V(fl.R,.u-)Y(/i,fl1,u-)  z  ) 

1  £|'  l  W*. *.,«.,<■*)  “  n  —  l/ 

,  [“"  /•‘rr[V(r,»i,u-)Y(r,..i,u-)rl  »  1 

Jo  Jo  \  hi.3(r,v>,Ui,a*)  n-lj 


(4.15) 


p  (n  -  l)p 


)dvdF(r), 
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where  /?(•  ;a,b)  is  the  density  function  of  Beta(a,6). 

Conditioned  on  u®  =  u  in  B(r,  y  »,Uj,u*),  Q  is  a  constant  orthogonal  matrix.  So 


B(r,y*,Ui,u)  =  (1  -  yjQUiUjQ' 
—  (1  —  yJU^Ui, 


(4.16) 


does  not  depend  onu.  Hence  B(r,y*,Uj,u*)  and  Y'(r,ya,u*)Y(r,  v£,u*)  are  independent, 
as  are  &n-2(r,  v»,Ui,u*)  and  Y*(r,  ya,u*)Y(r,ya,u*). 

Let 

UiUi  =  E 

(*u  ®12  \ 

.  .  -  (4-17) 

E21  E22  / 

and 


Ui  =  (uj  U2),  (4.18) 

where  uj  is  (n  —  1)  x  1,  U2  is  (n  —  1)  x  (p  —  1),  then 

K  U2)^(fl2Vi  (l-JZf)*V2),  (4.19) 

where  /?2(>  0),  vx  and  V2  are  independent,  and  fZf  ~  Beta(ayi,  ^  So 

«n-2  =  «u  —  E12E^  E21 

=  uj'ul  -  uI'U2(U^U2)"1U^uI 

±  Rfrl vr  -  fZ|v'1V2(V2V2)-V2v1  (4  20) 

=  f2|v'(I  -  V2(V2V2)-1  V2)vx . 
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Since  n  >  p  >  1,  V^V2  is  nonsingular  with  probability  one.  A  =  I  —  V^V^V^)-1  V^, 
conditioned  on  V2,  is  a  constant  symmetric  matrix,  A  =  A2  and  rank(A)  =  (n-1)— (p— 1)  = 
n—p.  By  Corollary  1  due  to  Anderson  and  Fang  (1982a),  v^Avi  ~  Beta(?iy£,  does  not 
depend  on  V2.  So  v'1(I-V2(V^V2)-1V^)v1  ~  Beta(£!=£,e=^)  and  en  2  ~  Beta(!~g, 

Then 


Pr{T2  <x}  =  J~  jf  jf  Pr  |V(r,  i>a  ,u*)Y(r,«a,u*)  <  } 


x  fi(w,  dwfi(pi  i  (l-Jk)  dv  iF{r)' 


(4.21) 


2’  2 


Let  y  =  xvo,  then 


Pr{T2  <  x}  =  jf°°  jT1  jf*  Pr  jv(r,  o*,u*)Y(r,  vi,u*)  <  j 


P(  ("  *>Pj  _n-_p  ,,  V  , 

x - — j — 7T-x  »  y  *  Ml )  a  *dy 

Wm*^) 


xP(v;Z{!l-Jk)dvdF(r). 


(4.22) 


The  density  of  T2  is 


/rW  =  l  Jo  Pr|Y'(r,»3,u*)Y(r,tf*,u*)  < 


r(^) 


r(n^£)r(ni^ll)  [  2 


[  n-p 

[  2 


~P  _£=*_,  ZL=S_>,,  t/.  "(p-» 

- a:  a  a  A(1  -  -)  a  1 


+( 


)<fodF(r) 
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„  n  ~  PtOI  —  .  n  ~  P  ,  ,  »(p-l)  „  0,._.  »~P 

X  — —  \P(w;  ~y~  +  1, - ^ - ~y~’ - 2 - '* d 


(4.23) 


We  have  the  following  theorem 


Theorem  4.1. 

//Xnxp  ~  LEC„xp(p,lD,<p),  n  >  p  >  1,  and  the  density  o/X  exists,  then  the  density 
ofT2  has  the  form  (4-23). 

Next  we  want  to  attack  the  problem  of  LMPI  by  using  Theorem  4.1.  First  let  us  define 
LMPI. 


Definition  4.1. 

A  level  a  test  ipo  is  said  to  be  locally  most  powerful  if  given  any  other  level  a  test  ip, 
there  exists  S  such  that  the  power  of  ipo  is  bigger  than  the  power  of  ip  (i.e.  P^,0{9)  >  P$(  9) 
for  all  9)  with  0  <  d{9)  <  6,  where  d(9)  is  a  measure  of  the  distance  of  9  from  Ho- 

Definition  4.2. 

A  level  a  test  ipo  is  said  to  be  locally  most  powerful  invariant  (LMPI)  if  the  test  is 
locally  most  powerful  among  the  family  of  all  invariant  tests. 

Theorem  4.2. 

Assume  X^xp  ~  LECnXp(p,  £,  <p),  n  >  p  >  1,  and  that  the  density  ofX.  exists.  For 
testing  Ho  :  p  —  Po  va  -Hi  •'  P  ^  Po>  the  T2-test  is  LMPI. 
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Proof. 

Since,  from  Theorem  4.1,  the  non-null  density  of  T2  has  the  form  (4.23), 


dJl 

dr 


But 


x  + 1.  - 1,  -  «-,•  =^,l  -  ,,24) 


..  a,_.  P  («-l)Pw_ 
x  P\v >  a’  2  ®w‘ 


V(J2,«»,u*)Y(.R,t>»,u*)  =  ~  +  -/i0)'D  V  +  w, 


(4.25) 


where  r2  =  n(/i  —  p0)'E  *(/!  -  p0).  There  exists  a  p  x  p  orthogonal  matrix  Qt  with  first  row 
"ifr*  Ma)  _ —  such  that 


Qiu  * 


“(7) 


(4.26) 


i.wf 


where  Pr{£  =  1}  =  |  =  Pr{£  =  —1},  2Tj  ~  Beta(|,  e^)  ,  and  B,  Zi  are  independent.  So 


Y{R, vi,u*)Y(B, va,u*)  -  ^  +  2^(vZi)lB  +  v, 


(4  27) 


and 
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Pr{Y'(fll»U*)Y(fl,«*,u*)  <  } 

_  f  T2  ^T,  ^  x(l  —  «)tol 

=  Pr{B?  +  2«(,,Zl),B  +  ”s-^l-} 


(4.28) 


where  0  <  v,  w  <  1. 

Since,  tinder  0  <  v,  w,  zx  <  1  and  Pr{J2  >  0}  =  1, 


fp  +2^(««i)»+v  < 


*(1  —  «)to 
n-  1 


«M^  +  («*i)’)2  +  v(l-*i)< 


x(l  —  v)w 
n-  1 


0  <  (~  +  (v*i)»)2  <  -  v(l  -  2*) 

0  <  +  (vzi)»)  <  [— — and 

A  f»  —  1 


0  <  -jj  <  [— — —  v(l  —  zi )J*  —  (w«i)»  and 
it  n  —  1 

- —  >  v(i-«i), 

n  -  1 


(4.29) 


and 
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r2  „  T  ,  U  ,  ^  *(1  “  «)« 


.x{l  -v)w  ..  >,1  .  r  ,  .1  .r*(l-»)w  .  .,1 

O  ~[  ~n  r'l  “  "f1  "  *»)!*  ~R~  (VZ^a  ~  1  —  i —  ~  W(1  “  (4.30) 

a;(l  —  v)w 

— —  a  0(1  “ •»> 

1  ,x(l-v)w  .  .,1  .  r  1  rl(l-w)tB  1 

O  {vZi) »  -  [— - — - t?(l  -  Zi)\  »  <  -  <  ( VZi )  *  +  [— — - w(l  -  Zt)\  > , 

n  —  l  it  n  —  l 

x[l  -  v)w  ^  „  x 

— — >  »(i  -  ii). 


Pr{Y'(fi,eiX)Y(fl,»S,u,)<i“p?} 

“/,  (Pr  {“  a  [!tt=a=  -  ,(1  -  *,)!*  -  („,)* } 

+  Pr  {B  a  l51^  -  •(>  -  *i)]*  +  («i)s })  +  Ar>r' 


where  A  =  >»(l-z,)},f=  {[^  -  .(I  -  z,)]i  >  («,)*}, 


(4.31) 
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and  0  <  v,  w  <  1. 

We  have,  from  (4.23), 

dfr{z) 


dr 


lr=0 


=  It,  It,  It,  (|«il=d»  -  „(l  -  *,)|i  -  (»r,)!  /(0) 

-•(»-  *01*  +  (•*)*' m) 

+  ((<-*1)5-1^ —(1 -*i)]i/(0) 

+  (»*,)* /(0))] 


I^— (» 


1  p-1,  ,  n  —  p  „  nip  —  1)  n  —  p  nip  —  1).,  , 

x  0(*i;  — j— )  dzi[p(w,  ~~  +  1, - - - 1)  -  P{w,  — — , - ^—L)]  dw 


^>i:i:i:^^a^ 


1  p  —  1.  ,  ...  n  —  p  „  n[p  —  1)  .  n  —  p  n{p  —  1).,  . 

x  0{zf, — — )  dzi[0{w,  +  1, - 2 - “  £(•»  — Y~ ’ - 2 — dw 


(4.32) 
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where  /(•)  is  the  density  function  of  12. 

*M«)|  _ 

Thus  jfojftlW  d°es  not  depend  011 density  of  R.  Since  /( 0)  is  constant  and  , under 
multivariate  normality,  the  T2- test  is  LMPI,  the  T2-test  is  LMPI  in  multivariate  elliptically 
contoured  case.  | 

Kariya  (1981a,  1981b)  has  derived  that  T2-test  is  UMPI  under  a  subfamily  of  multi¬ 
variate  elliptically  contoured  distribution  ( g  (see  (3.2))  is  nonincreasing  convex  function.) 
For  the  more  general  case  as  the  one  in  Theorem  4.2,  the  property  of  UMPI  is  still  an  open 
problem. 


Chapter  5 

TESTING  VARIOUS  OTHER  HYPOTHESIS 


In  this  chapter,  invariant  tests  for  testing  various  hypotheses  are  studied.  There  are 
five  sections  in  this  chapter.  In  each  section,  the  invariant  tests  and  their  properties  in 
different  hypothesis  testing  are  derived.  The  order  of  these  problems  are  test  for  lack  of 
correlations  among  the  sets,  test  of  equality  of  covariance  matrices  and  mean  vectors,  test 
for  known  correlation  coefficient,  test  for  known  partial  correlation  coefficient,  and  test  for 
zero  multiple  correlation  coefficient. 

5.1.  Criteria  for  testing  lack  of  correlations  among  the  sets. 

Consider  X„Xf,  ~  LECnXp(p,'E,  <j>),  with  E  >  0,  and  n  >  p.  If  the  density  of  X  exists, 

I 

then  the  density  of  X  has  the  form  of 

|E|-*"  g( trE-x(X  -  l„/i')'(X  -  lnp%  (5.1) 

where  l'„  =  (1,  -  -  * ,  1). 

Suppose  we  partition 


E  = 


/  Eu 


Vs,i 


(5.2) 


such  that  Eyy,  i  =  =  1, . . . ,q  is  pt-  x  py  matrix  and  22*=iPi  =  p.  We  wish  to  test 

Ho  :  E,y  =  0,  i  ^  j  against  the  general  alternative  Hi  :  E,-y  ^  0  for  some  *  ^  j.  First  we 


have  to  find  the  sufficient  statistics.  Since 
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(X  -  l„/i')'(X  -  l„p') 

=  (X  -  lnx')'{X  -  l„xf)  +  (l„(x  -  |i),),(l»(*  -  **)') 
=  A  +  n(x-p)(x-/i)', 


(5.3) 


the  density  of  X  has  the  form  of 

|2|- *”  g{ tr  E-1[A  +  n(x  -  p)(x  -  /*)']),  (5.4) 

where  x  =  £xfl„  and  A  =  (X  —  lnx')'(X  —  lnX1).  By  the  Factorization  theorem,  A  and  x 
form  a  sufficient  set  of  statistics  for  E  and  p.  This  problem  remains  invariant  under 


(1) X  h*  X  +  l„d\ 

(2) X  •-»  XC,  (5.5) 


where  C  is  a  matrix  with  nonsingular  diagonal  blocks  of  C,-  with  order  p,-,  t  =  1, . . .  ,q,  and 
off-diagonal  blocks  of  0’s. 

Under  group  (1), 


AwA,  xHjt  +  d. 

So  the  only  invariants  of  the  sufficient  statistics  are  A  (by  letting  d  =  — x.) 

(2), 


(5.6) 


Under  group 


/A 


where  A  = 


li 


V  ^gl 

blocks  of  A i  =  1, 


CjAnCi  •••  C'Ai,<V 

VC'^Cx  •  C^C,, 


(5.7) 


:  I  and  A ,■/  is  p,-  x  pj  matrix.  Let  Ao  be  a  matrix  with  diagonal 
A  gg  J 

,q,  and  off-diagonal  blocks  of  0’s,  B  be  a  matrix  with  diagonal  blocks 
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of  B,-,  i  =  1  ,...,q,  and  off-diagonal  blocks  of  0’s  such  that  =  I,  »  =  1, . . . ,g,  then 

B'AoB  =  I.  The  invariants  of  the  sufficient  statistics  are 


/  I  BiAijBa 


B'AB  = 


B'2A21B! 


I 


AigB?  \ 

b'2a2,b. 


(5.8) 


VB^Bx  B'^B*  •••  I  J 

So  Q'B'(A  —  Ao)BQ  is  invariant  with  respect  to  group  (2),  where  Q  is  a  matrix  with 
orthogonal  diagonal  blocks  and  off-diagonal  blocks  of  0’s.  Hence  we  conclude  that  the 
invariant  tests  for  the  multivariate  elliptically  contoured  distributions  are  same  as  the  ones 
in  multivariate  normal  case. 


Some  invariant  test  criteria  are  : 


(1)  Likelihood  ratio  criterion  : 

|B'(A  -  Ao)B  +  l| . 

(5.9) 

(2)  Nagao’s  criterion  : 

^tr[B'(A-Ao)B]2. 

(5.10) 

Since  the  likelihood  ratio  criterion  is  admissible  in  the  multivariate  normal  case,  from 
Theorem  3.2  and  Theorem  3.4,  we  know  that  the  likelihood  ratio  criterion  is  also  admissible 
in  the  multivariate  elliptically  contoured  case. 

Next  we  want  to  find  the  null  and  non-null  distributions  for  the  above  criteria.  Prom 
Corollary  2.1,  we  have  the  following  corollary. 


Corollary  5.1. 

For  both  the  null  and  non-null  cases,  the  distribution  of  B'(A  — Ao)B  does  not  depend 
on  any  particular  underlying  elliptically  contoured  distribution,  i.e.  the  null  and  non-null 
distributions  are  same  as  the  ones  in  the  multivariate  normal  case. 
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From  the  above  corollary,  we  know  that  the  distributions  of  likelihood  ratio  criterion 
and  Nagao’s  criterion  are  the  same  as  the  ones  in  multivariate  normal  case.  So  their 
asymptotic  distributions  are  the  same  as  the  ones  in  multivariate  normal  case. 

5.2.  Testing  of  equality  of  covariance  matrices  and  mean 
vectors. 

Consider 


AfUC,nXp(M;Ei,  ...  ,£i,£2?  •••  >^2>  •••  •••  ,Eq;<f>) 

with  n,-  E.-’s,  E,  >  0,  n,-  >  p,  i  -  ELi  =  n>  XH‘>  3  =  l1.-  -.".-.  *  =  1, •••,?,  are 

p  x  1  vectors,  the  rows  of  M  are  n,-  (/ij)’s,  i  —  1 . .  and  the  density  of  X  exists.  Then 

the  density  of  X  has  the  following  form 

(niE.r^-),(Etr^G.),  (5.H) 

1=1  1=1 

where 


"i 

G<  =  ~  “  Pi)' 

i=i 

-A i  +  rti(xi  -  m){xi  -  Pi)', 
m 

Ai  =  “  Xj)(*v  -  XfV, 

/=i 


and 
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By  the  Factorization  theorem,  A,-,  xy,  *  =  1, . . . ,  q,  form  a  sufficient  set  of  statistics  for  E 
Hi,  i  =  1 . . . ,  q.  Now  we  wish  to  test  £f0  :  /ij  =  •  •  •  =  and  Ex  =  •  •  •  =  E9  against  the 
general  alternative  Hi  :  /it-  ^  fij  for  some  t  ^  j  or  E ,•  ^  Ey  for  some  i  ^  j. 


The  problem  remains  invariant  under 


(1) x,y  t-+ xyy  +  d,  V*,j, 

(2) D  »-*  DC*-1,  BwC'B, 

(3) x,y  h*  Cx,y,  Vt,j, 


(5.12) 


where  C*  and  C  are  nonsingular,  D  =  —  ftq,  •  ■  ■  ,ftq-i  —  pg),  and  B  is  a  (q  —  1)  x  n  matrix 

with  (i,  j)th  element  1  when  n*  <  f  <  Xlt-i  nk,  0  otherwise. 

Under  group  (1), 


A,- 1-4  A »■,  x,-  *-♦  x,-  +  d,  V*.  (5.13) 

Hence  the  invariants  of  the  sufficient  statistics  are  (Ax,  •  •  • ,  A^)  and  (xi  —  x^,  •  •  •  ,Xg_i  —  x9). 
Under  group  (2),  as  in  MANOVA,  the  invariants  of  the  sufficient  statistics  are  (Ax,  •  ■  • ,  A^) 
and 

H  =  (Xl  —  X,,  •  •  •  ,X,_1  -  Xf)E(xx  -  X,,  •  •  •  ,Xq-l  -  Xf)', 
where  E  =  (e;y)  is  a  (q  —  1)  x  (q  —  1)  matrix  with  the  element  eyy  =  5yyny  —  ^n,r»y.  Since 

A  =  ]T(xyy  -  x)  (xyy  -  x)' 

•=1  ;= 1 

« 

=  H  +  £a,, 


(5  14) 
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where  x  =  £  x»7>  t^e  invariants  of  the  sufficient  statistics  are  A,  Ai,  •  •  • ,  A,. 

Under  group  (3), 


A  CAC',  A,  CAiC',  Vi,  (5.15) 

where  C  is  nonsingular. 

Let  CAC*  =  I,  then  QCA,-C'Q',  *  =  1, . . .  ,g,  is  invariant  with  respect  to  group  (3), 
where  Q  is  an  p  x  p  orthogonal  matrix.  Therefore  we  conclude  that  the  invariant  tests  in 
multivariate  elliptically  contoured  distribution  are  same  as  the  ones  in  multivariate  normal 
case. 

Some  invariant  test  criteria  are  : 


(1)  Likelihood  ratio  criterion  : 

nt, 

|A|S" 

(2)  Bartlett  modified  likelihood  ratio  criterion  : 

re,. 

|A|$("-«> 


(5.16) 


(5.17) 


For  dealing  with  the  null  distribution  we  have  the  following  corollary  from  Theorem 

2.1. 


Corollary  5.2. 

Under  =  •  •  •  =  pq,  the  joint  distribution  of  CAiCf,  i  =  1, . . .  ,q,  does  not  depend  on 
any  particular  underlying  elliptically  contoured  distribution,  i.e.  1 1  is  same  as  the  one  in  the 
multivariate  normal  case. 

from  the  above  corollary,  the  null  distribution  of  the  invariant  test  criterion  in  the 
multivariate  elliptically  contoured  distribution  is  the  same  as  the  one  in  the  multivariate 
normal  case. 
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5.3.  Test  of  Hypothesis  about  correlation  coefficient. 


Assume  Xnxp  ~  LEC„xp(p,  E,  <f>),  with  E  >  0  and  n  >  p,  and  the  density  of  X  exists, 

(Eii  Ei2\  (  a\  p(ritT2\  . 

I ,  En  =  (  ]  is  a  2  x  2  matrix.  We  wish  to  test  H0  : 

E2i  Sm/  \P<r <r\  ) 

p  =  Po,  where  |  pa  \  <  1  is  a  given  number. 


Let  B 


a  p  x  2  matrix,  then  from  Corollary  1  of  Anderson  and  Fang 


(12X2  \ 

o)be 

(1982b),  XB  ~  L£<7„x2(/iB,En,^)  with  f  e  $2„  ~  R*  =  Rbn  in{p_2),  Pin{p_2)  ~ 

Beta(n,  |n(p  —  2)).  So  .without  loss  of  generality,  let  p  =  2.  i.e.  X„x2  ~  L£7Cnx2(#i,E,^), 

(o\  poxo-i  \ 

I  .  The 

P°\°2  <j\  ) 

density  of  X  has  the  form  of 


|E|“  y(tr  E-1G), 


(5.18) 


where 


G  =  (X  -  l„/i')'(X  -  ln/i') 


=  A  +  n(x  —  *j)(x  -  p)\ 


-X'l„, 


and 


A  =  (X  -  l„x')'(X  -  l„x') 

(Oil  012  \ 

®21  «22  / 


(5.19) 


By  the  Factorization  theorem,  A  and  x  form  a  sufficient  set  of  statistics  for  E  and  p. 
The  problem  remains  invariant  under 
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(1) X  *-*  X  +  l„d', 

(2) X  w  XC,  (5.20) 


(a  o\ 

where  C  =  I  ) ,  and  e i,  >  0. 

VO  c2J 

Under  group  (1), 

A  •-»  A,  xwx  +  d,  (5.21) 


and  the  only  invariants  of  the  sufficient  statistics  are  A  (by  letting  d  =  — x.)  Under  group 

<2)’  •  .  X 

Clflll  CiC2ai2  \ 

Vclc2fl21  c|a22  J 


(5.22) 


The  only  invariant  (i.e.  maximal  invariant)  of  the  sufficient  statistics  is  ri2  =  -fJV  (by 

letting  c\  =  Ojj"  ,  C2  =  a22J .)  We  conclude  that  the  invariant  tests  in  multivariate  elliptically 
contoured  distribution  are  same  as  the  ones  in  multivariate  normal  case. 

For  finding  the  null  and  non-null  distributions  of  n2,  from  Corollary  2.1,  we  have  the 
following  corollary. 


Corollary  5.3. 

For  both  the  null  and  non-null  eases,  the  distribution  of  rj 2  does  not  depend  on  any 
particular  underlying  multivariate  elliptically  contoured  distribution. 

EYom  the  above  corollary,  all  the  optimum  properties  of  ri2  for  multivariate  normal 
case  is  fulfilled  for  multivariate  elliptically  contoured  case,  such  as  :  asymptotic  normality, 
and  “Fisher’s  z”  test.  From  Corollary  2.2,  rj2  is  UMPI  against  alternatives  p>  p 0  . 
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5.4.  Test  of  hypothesis  about  partial  correlation  coefficient. 


Assume  X„Xp  ~  LECnxp(p,  £s  <j>),  with  E  >  0  and  n  >  p  >  q,  and  the  density  of 
/  £n  E12  \ 

X  exists,  where  E  =  I  j ,  and  En  is  a  9x9  matrix.  We  wish  to  test  Ho  : 

\  S21  S22  / 

Pi2  q+i,  -  j>  =  Po>  where  |  p0  |  <  1  is  a  given  number.  Without  loss  of  generality,  assume  that 
9  =  2  (by  the  same  technique  as  in  section  5.3.) 


Let  X  =  (Xi,  X2),  where  Xi  =  (xi,  X2)  is  n  x  2,  X2  is  n  x  (p  —  2),  and  xi,  x2  are 
n  x  1  vectors.  The  density  of  X  has  the  form  of 


|£|- *"  g(tr  E-1G),  (5.23) 

where 

G  =  (X  —  lnft'Ypi  —  In/*’) 

=  A  +  n(x  -  #i)(x  -  t*Y, 

-  - 

x  —  Xl„, 
n 

A  =  (X-l„x,)'(X-lnx') 

An  A12  \ 

A21  A22  / 

(an  ai2  \ 

]  is  a  2  x  2  matrix. 

®21  °22  / 

By  the  Factorization  theorem,  A  and  x  form  a  sufficient 


(5.24) 


set  of  statistics  for  E  and  ft. 


The  problem  remains  invariant  under 
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(1)X  h*X+  lnd', 


(2)X 


(3)X 


(4)Xi  ~  Xi 


c:> 
(::> 
c  :,)■ 


(5.25) 


where  C  is  a  (p  —  2)  x  (p  —  2)  non-singular  matrix,  B  is  a  (p  —  2)  x  2  matrix,  and  5lt  fc2  >  0. 
Under  group  (1), 

AhA,  xHjc  +  d.  (5.26) 

So  the  only  invariants  of  the  sufficient  statistics  are  A  (by  letting  d  =  — x).  Under  group 

(2), 


(  An  A12C 

VC’A21  C*A22C 


(5.27) 


Hence  the  only  invariants  of  the  sufficient  statistics  are  An,  A*2C  (by  letting  C'A2iC  =  I.) 
Under  group  (3), 


An  •-*  An  +  B,C*A2j  +  Aj2CB  +  BfD, 

Ai2C  h-  Ai2C  +  B'.  (5.28) 


The  only  invariants  of  the  sufficient  statistics  are 
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Ah-2  =  An  —  A12AJj"2l  A21 

a  u-s,~*  a  12-3,— 
a  22-3, -,p 

(by  letting  B'  =  —  Ai2C.)  Under  group  (4), 


An-2  *-* 


f  M2<*12 


\  M2a21-3,-,p  &ia22-J 

Thus  the  only  invariant  (i.e.  maximal  invariant)  of  the  sufficient  statistics  is  r 12-3, •••*  = 


Mr#  \ 

V*  / 


(5.29) 


_i 


— y - ,  (by  letting  6,*  =  a-.,  ,  i  =  1, 2.)  We  conclude  that  the  invariant  test  in 

an  s,  -,paM  i,  -,p 

multivariate  elliptically  contoured  distribution  is  same  as  the  one  in  multivariate  normal 


case. 


Next  we  want  to  find  the  null  and  non-null  distributions  of  r i2-s,-j»  for  multivariate 
elliptically  contoured  distribution. 


Corollary  5.4. 

For  both  the  null  and  non-null  cases,  the  distribution  of  ri2  S,-vp  does  not  depend  on 
the  underlying  multivariate  elliptically  contoured  distribution. 

Proof. 

Clear  from  Corollary  2.1.  I 


From  the  above  corollary  and  Theorem  4.3.5  due  to  Anderson  (1984),  the  distribution 
of  a  sample  partial  correlation  coefficient,  ri2-s,-j»  based  on  X„Xp  with  population  partial 
correlation  coefficient  p  12.3, equal  to  a  certain  value,  p,  is  the  same  as  the  distribution 
of  an  ordinary  correlation  coefficient  based  on  X(n_(p-9)]xp  with  corresponding  population 
correlation  of  p.  Also  the  asymptotic  normality  and  “Fisher’s  z”  test  are  fulfilled  for  the 
multivariate  elliptically  contoured  case.  Obviously,  #12-3,-^  is  UMPI  against  alternatives 
Pl2-3,—j>  >  PQ  ■ 
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5.5.  Test  of  hypothesis  about  the  multiple  correlation 
coefficient. 


Assume  Xnxp  ~  LECnxp(fi,E,  <f>),  with  E  >  0  and  n  >  p,  and  the  density  of  X  exists, 

/  Eh  £12  \  _ 

where  E  =  I  J ,  and  En  is  a  q  x  q  matrix.  We  wish  to  test  Ho  :  Ri-q+i,...#  =  0, 

\E2i  E22/ 

against  the  general  alternative  Hi  :  Ri-q+i-.j  >  0.  So  , without  loss  of  generality,  assume 
that  q  =  1  (by  the  same  technique  as  the  one  in  section  5.3.) 

Let  E  =  (  ^  V  then  Ri-2,-#  =  —•***-  ”  (>)^  -  =  0  iff  =  0.  The  test  is 

\<T(1)  E22  /  *n 

same  as  Ho  :  1)  =  0  against  Hi  :  <r^  £  0.  This  is  a  special  case  of  testing  for  lack  of 

correlations  among  the  sets.  Then,  by  the  result  in  section  5.1,  B^a^Ai  are  the  invariants 

of  the  sufficient  statistics,  where  B2A22B2  =  I,  fifan  =  1,  and  A  =  (  ) .  Since 


[(T*  ji 

then  Ri-2,-.  j  =  — ***— - t~ — —  =  0  iff  <rji)  =  0.  The  test  is 


the  problem  remains  invariant  under  the  group  (Q  |  Xh+  XQ,  where  Q 
Qi  is  (p  -  1)  x  (p  -  1)  orthogonal  }.  Under  this  group, 


=  (“"  “wY < 

\a(l)  A22/ 

(1  0\ 

tere  Q  =  I  J 

Vo  qJ 


B2f«(i)6i  *-♦  Q'1B2,a(i)6i.  (5.30) 

The  only  invariant  (i.e.  maximal  invariant)  of  the  sufficient  statistics  is  6i(a,(1)B2B^a(1))a  = 
=  R  (by  letting  the  first  column  of  Qi  is  B^a^).)  Also  ,by  Corollary  5.1, 
we  claim  that  the  null  and  non-null  distributions  of  R  in  multivariate  elliptically  contoured 
distribution  are  same  as  the  ones  in  multivariate  normal  case.  So  R  is  UMPI.  Also,  the 
invariant  test  is  same  as  the  one  in  multivariate  normal  case. 


Chapter  6 


MAXIMUM  LIKELIHOOD  ESTIMATES  AND 
LIKELIHOOD  RATIO  TESTS 


In  this  chapter  maximum  likelihood  estimates  (MLE)  and  likelihood  ratio  tests  (LRT) 
for  the  multivariate  elliptically  contoured  distribution  are  derived.  We  first  give  the  result 
for  a  special  case,  then  some  examples,  and  finally  the  general  theorem 

6.1.  The  result  for  a  special  case. 

Anderson  and  Fang  (1982c)  attacked  the  problems  of  MLE  and  LRT  and  gave  some 
conditions  for  finding  MLE.  In  this  section,  more  precise  and  weaker  conditions  are  given. 
The  following  lemma  contains  the  general  idea  for  constructing  the  weaker  conditions  in 
finding  MLE. 

Lemma  6.1. 

Assume  that  g(-)  it  a  continuous  function  such  that  g(x2  H - 1-  Xy)  it  the  density  of 

EC?f(0, !,</>)  and  E(R2)  <  oo  where  R  «-♦  <f>  €  Then  the  function 

h(x)  =  x^g(x),  x  >  0,  (6.1) 

has  a  maximum  at  some  finite  xq  >  0. 

Proof. 

Since,  from  Cambanis,  Huang  and  Simons  (1981)  (or  Lemma  13.3.1  of  Anderson 
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(1984)),  the  density  function  of  R 2  is 

r(^) 

M*)  =  — 

T  a 


(6.2) 


From  E(R2)  <  oo,  lim,_oo  h(x)  =  0.  Furthermore  h( 0)  =  0.  (If  not,  there  exists  e  >  0, 
xf(x)  >  e  Vx  >  6,  so  1  >  /g  f(x)  dx  >  f£  -  dx  =  oo  which  is  a  contradiction.)  Hence 
h(x)  >  0  Vx  >  0,  and  h()  is  continuous.  The  assertion  of  the  lemma  follows.  | 

The  following  theorem  is  the  main  theorem  in  this  section  and  let  us  define  the  as¬ 
sumption  for  this  theorem. 

Assumption  A. 


X  ~  M£K7nXp(M;sW,  ...  ,EW,  ...  ...  ,eM;^)  with  n,-  E^  ’a,  n{  >  p, 

i  =  1  and  Si=i n«  =  n-  ^/ie  rows  o/M  are  n*  pS1)  ’s,  ...,  nq  ’s,  successively, 

where  E^  =  diag(E['j , . . . ,  E**),  1  <  k  <  p.  The  density  function  ofX  is 


Theorem  6.1. 


Under  the  assumption  A,  if  /(A)  = 


A"i 


np9(\)  attains  its  maximum  at  some  finite 
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Xmax(g),  then  the  MLEs  of  pW  and  E^  are 

=  xW, 

E(0  =  diag(ES?,...,E$), 


(6.4) 


*  =  where  Eyy  =  hmvt{g)wfj ,  i  =  j  =  is  obtained 

from  by  substituting  /|W  withx^  =  jjr  ]£yiflf_1+1X(y)  an^the  corresponding  maximized 
likelihood  is 


(wrt)  5  [(^’^nKr5”' 


(6.5) 


Proof. 


max  L(|i(l\. . .  ,#i(?);E(l), . . .  ,E(,)) 
yi<0,E(,) 

=  max L(p{1) p{q) ; E(1) , . . . , E(,) ) , 
/*(•)  £(■) 


(6.6) 


where 


L=L(/|W,...,^;E<1),...,E<«)) 


nnN?ri"' 

»=i  i=i 


(££>«)  • 


(6.7) 


First  we  want  to  maximize  L  with  respect  to  E^,  *  =  1, . . .  ,q.  Since  G j-y  >  0,  *  =  1, . . . ,  q, 


y=l,...,Jb,  with  probability  one,  there  exist  nonsingular  matrices  C,-y  such  that  CyyCj-y  = 
Gyy  and  orthogonal  matrices  r,y  such  that 


rl/°?5:5yci,"r«  =i<i 


=  diag(AjJ,...,Aj.J  ), 
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with  positive  Ay’2»  a  =  1, . . .  ,py.  Therefore 


L  = 

9k/  Pi  \  a”**]  (  i  k  Pi  \ 

IIII  N?|IIM2  »  EEE‘/M2  • 

V  £»=1  /  J  \^i'=l  ;=1  o=l  J 

(6.8) 

It  is  a  symmetric  function  of  A J2  (j  =  1, . • . , k,  a  =  1, . . .  ,py)  for  %  =  1, . . 
Ay0  =  j  I j •  •  •  i Aj  ft  1, jPyj  and 

. ,  q.  Hence 

L  = 

(6.9) 

Let 

-  =  F  — 

A  4^  A(')’ 

1=1 

(6.10) 

w’  y  =  i,.--, g-i- 

So  maximizing  L  with  respect  to  i  =  1, . . . , q,  is  the  same  as  maximixing  L  with  respect 
to  A,  ft,  j  =  -  1. 


Now 


So 


max  L 
A,0. 

=nnN?f’"‘ 

•'=»  ;=i 


(6.11) 


(6.12) 
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L  attains  its  maximum  at  A  =  A max(ff)>  ft  =  7*,  *  =  1, . . . ,  g  —  1  and 


L*  =  max  L 

S(i) 


—  Ama*(p) 


(6.13) 


Since  G,-  =  W,-  +  n,-(x^  -  #»W)(xM  —  /|W)',  maximizing  L*  with  respect  to  j|W,  *  =  1, . . .  ,g, 
shows  that  L  attains  its  maximum  at  #|W  =  xW.  This  completes  the  proof.  I 

Combining  the  above  lemma  and  theorem,  we  have  the  following  corollary. 

Corollary  6.1. 

Under  the  assumption  A,  if  y(-)  is  continuous  and  E(R 2)  <  00  where  R  *-*■  <t>  €  $n p, 
then  the  MLEs  of  pW  and  are 

*W  =  *«, 

E(<)  =  diag(ES?,...,S«), 

*  =  1, . . . ,  q,  where  sjy  =  (“)Amai(^)  wjy  ,  t  =  1, . . . ,  q,  j  =  1, . . . ,  k,  and  the  corresponding 
maximum  of  the  likelihood  function  is 


(6.14) 


(6.15) 


Under  some  conditions,  Theorem  6.1  and  Corollary  6.1  show  that  the  maximum  of  the 
likelihood  function  for  the  multivariate  elliptically  contoured  distribution  is  proportional  to 
that  of  the  multivariate  normal  distribution.  So  the  following  likelihood  ratio  criteria  in 
the  multivariate  elliptically  contoured  distribution  are  the  same  as  those  in  the  multivariate 
normal  case: 

(1)  The  criterion  for  testing  for  lack  of  correlation  among  the  sets  (g  =  1): 

|W| 


n  = 


nu  iw»i 


(6  16) 


Chapter  6:  Maximum  likelihood  estimate i  and  likelihood  ratio  tests  49 


(2)  The  criterion  for  testing  the  hypothesis  that  a  mean  vector  is  equal  to  a  given  vector  /i0 
(q  =  k  =  1): 


|wl 

|W0| 


1 


1  +  r2/(n  -  1)  ■ 


(6.17) 


where  Wo  =  (x(/)  —  #*o)(x(/)  ~  PtiY  T2  is  the  Hotelling  T2-statistic  (see  Chapter 

4.) 


(3)  The  criterion  for  testing  the  hypothesis  of  equality  of  covariance  matrices  (k  =  1): 


(4)  The  criterion  for  testing  equality  of  several  means  (k  =  1): 


(6.18) 


_  1W| 


\£UW> 

(5)  The  criterion  for  testing  equality  of  several  means  and  covariance  matrices  (Jfe  =  1): 

\p»i 


(6.19) 


-na!"‘£r' 


(6.20) 


(6)The  criterion  for  testing  the  hypothesis  that  a  covariance  matrices  is  proportional  to  a 
given  matrix  Eg  (q  =  k  =  1): 


r«  = 


iyw| 

(tr  Sq1W/p)p 


(6.21) 
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6.2.  Examples. 

Section  6.1  gives  six  examples  for  finding  LRT.  In  this  section,  three  examples  that 
do  not  fulfill  the  assumptions  in  Theorem  6.1  are  given.  From  the  results  of  these  three 
examples,  we  get  the  idea  of  general  theorem  in  section  6.3.  The  following  three  examples 
satisfy  the  assumption  that  /(•)  attains  its  maximum  at  Xxazx(g). 


Example  1.  Tests  of  hypotheses  concerning  subvectors  of  p. 


(CH** 


If  X„xp  ~  LEC„Xp  ((  |  ,  E,  <t>  j  with  E  >  0  and  n  >  p.  We  wish  to  test  Ho 

p1  =  0.  The  likelihood  function  has  the  following  form  : 


I(#i1,/i2,E)  =  |E|-l"y(trE-1G), 


-(*<)}(*<)} 


where  G 

Let  flj  =  {(#i,E)  |  E  >  0}  and  u  =  {(/i,E)  |  px  =  0,  E  >  0}.  It  is  clear  that 


max  £(#<!, |i2,E)  =  \m*x(g) 


where  W  is  obtained  from  G  by  substituting  ft,-  with  x,-  i  —  1, 2.  Since 


(6.22) 


(623» 


maxL(/ilt/i2,E)  =  maxmaxL(0,/i2,E), 


using  a  similar  argument  as  the  proof  of  Theorem  6.1,  we  get 


max  L(0, /i2,  E)  —  Amaz(^) 
£ 


(6.24) 


i"',(jdw)|Hri"'  (625) 


where 
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Thus 


and 


j(,-j 

(i  wr11w12\ 

(xi,  (x2  -  p2)')  =  (xi,  (x2  -  p2y  -  xi  WIi1  W12)  (  I  J  ,  (6.26) 


^1  W^Wnj"1  _^I  -W-'Wj,  j 


(  1  °) 

(  Wu 

w12\ 

/i  -w-/w13\  /Wn  0  \ 

V  -w21w-‘  l) 

Vw2i 

w22y 

\0  I  )\o  W22.j  J 

(6.27) 


imply  that 


|H|  = 


W  + 


•(.**  )(.*  )] 


=  |W|  l+n(x,1,(x2-/i2)')W-1(  Xl  )| 

\**-#*2/ J 


=  |W| 


/w-1 


11 


1  +  n(xi,  (x2  -  tt2y  -  xiw-xw12) 

V  o  w; 
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(  *’■  )1 


=  |W|  [1  +  n^'W^Xi  +  [(x2  -  /i2)  -W21W-1x1]'W^11 


(6.29) 


x  [(x2  -  p2)  -  WjiW^xx]}] , 


/Wu  W12\ 

\w2i  w2J 


where  W  =  (  )  and  W22.x  =  WM  —  W21W^jW12.  The  maximum  likelihood 

.W21  W22. 
estimate  p2  of  /i2  is 


P2  =  x2  -  W2iWf1ixi„ 


(6.30) 


So 


maxLfm^Z)  =  Amax(?)-Wff  (*  +  ^iWuXi)">n,  (6-31) 


and  the  likelihood  ratio  criterion  is 

maX(j  L(/i1,^2,S) 


|w|i" 


maxo  L(pi,p2,'E)  |W|i"  (i  +  rax*  W^x^i" 

=  ( _ i _ V". 

Vl+nxiW^xj/ 


(6.32) 


Example  2. 

If  Xnxp  ~  LECnxp[p,E,<l>)  with  E  >  0  and  characteristic  roots  A,-,  i  =  1, . .  ,p,  with 
Ai  >  A2  >  •  •  •  >  Ap  >  0.  We  wish  to  test  Ho  :  A,-+i  =  •  •  •  =  A,-+*,  t  +  k  <  p.  Since  E  >  0, 
there  exists  an  orthogonal  matrix  T  such  that  E  =  FAT'  where  A  is  an  diagonal  matrix  with 
the  (t,  £)th  elements  A{,  t  =  1, . . .  ,p  and  the  likelihood  function  has  the  form 
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where  H  =  r'GI\ 


Let  fi  =  {(it,  E)  |  E  >  0}  and  w  =  {(p,  E)  |  A,-+1  =  •  •  •  =  A,-+fc}.  It  is  clear  that 


maxL(/j,E)  =  Amax(^)"inpy 


(6.34) 


Since 


max  L(/i,E)  =  max  L(p,  r,A),  (6.35) 

u  p) Teo„ 

A€M(p>|>k) 


where  Op  is  the  class  of  p-dimensional  orthogonal  matrix,  and  At(p^jt)  =  {A  |  0  <  Ap  < 
•  •  •  <  A,-+t+1  <  Al+t  =  •  •  •  =  A*  +  1  <  A;  <  •  •  •  <  A!},  by  Theorem  6.1, 


max  L(p,  T,A)  =  Amax(y)~inp^ 
A€.M(p,|,k) 


n  vgv 

i+k+l<j<p 


-\n 


> 


(6.36) 

where  H  =  (ft,/)  =  (t[G 7/),  T  =  (71 ,  •  •  •  ,7P),  and  hpp  <  ftp_i  p_i  <■••  <  ftn.  Therefore 


G  =  W  +  n(x  -  #i)(x  -  p)\ 


(6.37) 
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hjj  =  'fjG'ij 

=  7y(  W  +  n(x  -  fi)(x  -  #i)'by 

=  jj'W'lj  +  -  /*)(x  -  ti)\-  (6.38) 

>  7/W7/, 


and 


maxAmax(j;)“5n,’( 


-in 


imply  that 


—  Aihm(j) 


[(*£**»*«)  JL^4"V 

v  7  *s/s» 


(6.39) 


Let  F  =  r'WT  =  (^jG^y)  =  (fij),  fpp<  •••  <  /n,  and  0  <  rp  <  •••  <  n  be  the 
characteristic  roots  of  W.  Then,  by  Theorem  9.B.1  due  to  Marshall  and  Olkin  (1979), 
f  -<  r  (r  majorizes  f)  on  Dp  =  {  x  G  9ip  :  Xi  >  •••  >  xp},  where  f  =  (fn,"  • ,  fpp)  and 
r  =  (ri,-*-,rp).  Define 


(6.40) 


where  x  6  3CP.  Then  i>  is  continuous  on  Dp  and  continuously  differentiable  on  the  interior 


Chapter  6:  Maximum  likelihood  estimates  and  likelihood  ratio  tests  55 


i/>l(x)  = 


dzt 


(6.41) 


is  decreasing  in  l  =  1, ...  ,p,  by  Theorem  3.A.3  due  to  Marshall  and  Olkin  (1979), 


/JS.. *“«■*''* (dsi)  [(*£*"*")  JL 

L  v  '  •<><» 


—  hjnxxis) 


,^max(j), 

Thus  the  likelihood  ratio  criterion  is 
maxyi^E) 


n  ■» 


■+*+•<><? 

!<><i 


nun 


-*n 


fa 


~sn 


maxn  L(p,  S)  I  (iy*k  _  \*n 

\  y  i  tsj=l  r«+j  j  II  •+*+>s>Si»  rj 


(6.42) 


(  n{.,rW  V 
U*  £**•*«); 


(6.43) 
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Example  S.  Circularly  symmetric  model 

If  Xnxpft  ~  LECnxpkip,  E,  <f>)  with  E  >  0  and  n  >  pk.  Consider  the  following  four 
hypotheses  : 

Hi  :  E  =  diag(Eo,*",Eo), 


<Eo  Ei  •••  Ei  Ei  > 

Si  Eo  •••  Ei  Ei 

Ei  Ei  *  *  *  Eo  Ei 

VEi  Ei  *•*  Ei  Eq; 


/  Eo  Ei  •  •  Ep_2  Ep_i  \ 
I  Ep_i  Eo  Ep_s  Ep_2  I 


Hs  :  E  = 


E2  Es  •••  E0 

V  Ei  E2  •  •  •  Ep_i 

Ht  :  E  >  0, 


E‘ 

Eo  J 


where  Ey,  i  =  0, 1, . . .  ,p  —  1  are  k  x  k  symmetric  and  E  is  pk  x  pk  positive  definite. 

If  E  is  circularly  symmetric,  then 

E  =  (W0  ®  E0)  +  (Wj  ®  Ei)  +  •  •  •  +  (Wp_i  ®  Ep_i),  (6.44) 

lP-i\ 

=  I,  Wy  =  j  ),  j  =  1, ...p  —  1,  and  Wy  =  WjJ .  From  the  theorem  in 

Viy  0  J 


where  W0  = 

Olkin  (1972),  there  exists  T  €  Op,  with  elements  ry*  given  by 

.(y-i)(*-i)\  _ _ x  (j  -!)(*-!)' 


rih 


=  p  2  |sin  ^2t- 


i)+c<4«i>)}, 


(645) 


such  that  (r®I)E(r'®l)  =  diag(#i,  •  •  • ,  9p)  =  D*  where  9j,  j  =  1, .. .  ,p,  are  k  x  Jb  positive 
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definite.  The  parameter  space  for  each  of  the  hypotheses  H\-3i  becomes 

wi  =  {D*  |  #1  =  •  •  •  =  #p  >  0,  given  */  =  *p-/+2,  j  =  2,  •  •  •  ,p}, 

u2  =  {D*  |  >  0,  *2  =  •••  =  *p  >  0, given*/  =  »p_/+2,  j  =  2,--,p}, 

w3  =  {D*  I  *i  >  0,  */  =  *p_/+2  >  o,  j  =  2,  •  •  •  ,p}, 

u/4  =  {E  |  E  >  0}. 

Let  V  =  (r  ®  I)S(r'  ®  I)  =  (V,/),  i  =  1, . ..  ,p,  j  =  1 . . 

When  p  =  2m, 

(Vi, •  •  • , Vm+1)  =  (Vn, V22  +  Vpp,-  -,Vmm  + Vm+2  m+2,Vm+1  OT+i),  (6.46) 

i 

while  p  =  2m  +  1, 

(Vi,---,Vm+1)  =  (Vu,V22  +  Vpp,---,Vm+i  m+i  +  Vm+2  m+2),  (6.47) 

where  Vy  =  Vp_/+2,  j  =  2, . . .  ,p,  and  S  =  (X  -  l„x')'(X  -  l„x'). 

Under  u>lt  w2,  w3)  the  likelihood  function  has  the  form 

p 

£(p,  *x,  ,*p)  =  n  I**'!"*"  ?(tr  D*'(r  ®  i)c(r'  ®  i)),  (6.48) 

»=i 

using  a  similar  argument  as  the  proof  of  Theorem  6.1, 


(6.49) 
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max 

W» 


(j^y)  IVnr^l^Ev, 


-in(p-l) 


(6.50) 


max £(<., f[  rv.r*" ,  (6.S1) 


(jjj^.)  |V|-S” ,  (6.52) 

and  the  likelihood  ratio  criterion  is  same  as  the  one  in  the  multivariate  elliptically  contoured 
case. 


6.3.  General  theorem. 

From  three  examples  in  section  6.2,  we  can  derive  the  general  theorem  for  finding 
MLE.  First,  let’s  generate  the  assumption  for  this  general  theorem. 

Assumption  B. 

Let  xpfxi  have  the  density  |S|_*  </((x  —  p)'E-1(x  —  ft))  for  EjvxJV  positive  definite  and 
Ppfxi .  Let  <jj o  =  wm  x  wt,  where  uim  C  $tN  and  ute  is  a  subset  of  positive  definite  matrices 
such  that  if  E  €  we,  then  aE  €  we  for  every  a  >  0:  Suppose  that  the  MLE’s  under  normality, 
p  €  uim  and  E  G  uie,  exist,  are  unique.  E  is  positive  definite  (with  probability  1),  and  let  the 
maximum  of  likelihood  function  be  L. 

Theorem  6.2. 

Under  the  assumption  B,  if  /(A)  =  A ~*Ng  ( j)  atttains  its  maximum  at  some  finite 
then  p  =  p  and  E  =  iVAm..(a)E  are  maximum  likelihood  estimates  for  g(  )  and 
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the  corresponding  maximum  of  likelihood  function  is 


e'*NL. 


(6.53) 


Proof. 

By  definition,  the  likelihood  function  has  the  form 

L  =  L{p, E)  =  |E|“*  y(trE-1G),  (6.54) 


where  G  =  (x  -  /i)(x  -  fi)'.  Since  E  >  0,  there  exists  an  orthogonal  matrix  T  such  that 
E  =  TAT',  where  A  is  a  diagonal  matrix  with  diagonal  elements  A,-,  i  —  1  and 

Ai  >  A2  >  •  •  •  >  Ajv  >  0  are  the  characteristic  roots  of  E.  Therefore 


where  H  =  TGr'  =  (hy). 


let  i  =  ZlLx  K^ii,  Pi  =  3  =  1,  •  •  • ,  N  -  1,  then 


(6.55) 


(6.56) 


Define  =  {(r,/?!,- •  •  ,0n-i)  :  E  G  ue}.  Let  E*  =  aE,  a  >  0,  then  A?  =  aA,-, 
*  =  1, . . . ,  N  and  A*  =  aA.  Therefore  the  range  of  A  is  (0,  co)  and  wj  does  not  depend  on 


60  Section  6.3:  General  theorem 


X  (defined  asw*).  So 
max  L(/i,E) 


max 
PCu>m 

(T.#t.~4v-i>c»' 


(6.57) 


-In. 


A max  (^ )  * 


(=i»)r  K  [«(«'']  HPF‘ 


Since  max  /i€Um  ^n^i1  (^)  *  j  ^  does  not  depend  on  g,  and  un¬ 

der  normality,  g(x)  =  (2x)-*jve~i*  and  Am,*^)  =  jj,  it  »  clearly  that  p  =  p,  E  = 
NXm!LX(g)ib,  and  the  corresponding  maximum  of  likelihood  function  is 


Am»x(g)  aJVg(wp) 

JVr^Zx)-*^ 

I 


e\NL. 


(6.58) 


Combining  the  above  theorem  and  Lemma  6.1,  we  have  the  following  corollary. 

Corollary  6.2. 

Under  the  same  assumptions  as  Theorem  6.2,  if  g(-)  is  continuous  and  E(R2)  <  00 
where  R  «-*■  <f>  G  $n,  then  p  =  p,  E  =  iVAmax^E,  and  the  corresponding  maximum  of 
likelihood  function  is  (6.58) 

Corollary  6.S. 

Let  p  €  nm  C  and  E  S  Qe,  satisfying  the  conditions  of  Theorem  6.2,  and  let  the 
null  hypothesis  H  be  p  G  wm  C  flm  and  E  €  we  C  (le.  Then  the  likelihood  ratio  criterion  is 
independent  of  <f>. 
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Proof. 

The  LRC  is 


from  Theorem  6.2,  which  does  not  depend  on  R  (i.e.,  f>.) 


(6.59) 


I 


Remarks  : 


(1)  For  every  random  matrix  X  distributed  as  multivariate  elliptically  contoured  distri¬ 
bution,  we  can  write  it  in  a  vector  form  which  has  an  elliptically  contoured  distribution  and 
a  density  function  same  as  the  one  in  Theorem  6.2.  Hence  Theorem  6.2  is  more  general 
than  Theorem  6.1.  and  the  three  examples  in  section  2  also  follow  from  Theorem  6.2. 

(2)  Since,  from  (1.6), 


Gov(x)  =  E(R2  tfu{N)u{N)'n) 

=  £J(R2)D'Cov(uW)D, 


(6.60) 


E(R2)  <  co  iff  each  element  of  Cov(x)  is  finite.  So  the  assumptions  in  Corollary  6.2  —  R 
has  finite  second  moment  and  </(•)  is  continuous  —  are  quite  reasonable  assumptions. 
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NONCENTRAL  DISTRIBUTIONS  OF 
QUADRATIC  FORMS 


(XU)\ 
1 

xWJ 

fSJ  Nn  .  Anderson  and  Fang  (1982a)  derive  the  distribution  of  where 

/x'1*  \  / / i*i\  \ 

x  =  I  1  ~  EC„  II  1,1  ;  <f>  J ,  under  the  condition  that  m  —  0.  In  this  chapter, 
we  derive  the  distribution  of  x^  x^  when  /  0. 

If  x  =  (  ]  ~  ECn  (  (  1  )  ,  I  ;  <f>  ] ,  where  xW  is  i-dimensional  vector,  then 

V*|2v  VW  / 

(u  \ 

\  ,  I  ,i  and  x^  —  ft1  +  J2U<n). 

u r) 

From  Lemma  2  of  Cambanis,  Huang  and  Simons  (1981),  u["^  —  where  Rt„  (>  0) 

and  UM  are  independent  and  R\n  ~  Beta(|,  2y^).  Thus  x^  —  nx  +  KR&nuW  and  the 
distribution  of  x^  xW  is 


Prob{x^*xW  <  at}  =  Prob{(f2fZ*„uM  +  #i1)'(J£Rfa,U^  +f*i)  <  x} 
=  Prob {R2R2kn  +  2RRknV^'nl  +  IM2  <  x} 

=  Prob{V  +  2V*U^Vi  +  Ikll2  <  x}, 


(7.1) 


where  V  =  R2R\n. 


where  = 


By  making  an  orthogonal  transformation,  we  get  —  ui!!a*i!!* 

.  Since  we  know  the  distribution  of  V  and  «i  (see  Anderson  and  Fang 
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(1982a)  ), 


Probfc^x^  <  x}  —  Prob{V  +  2V*i«i +  ||^il|2  <  x} 


=  /°°  Prob  jui  <  Z  ~  ^ll|a  ~  ”  1  g{»)dv 

Jo  \  J.i|M  J 


oo  r  r*  r(|) 


=f[L 


iffrU  -  u2)^1-1/{i«i<i} du 


-oo  T(^)xi 


g{v)  dv, 


where  b  —  — -  , 


•M  '  1.1  -  •)'**-* 


and  R  ~  F. 

For  x  >  0  and  v  >  0, 

2«  *i!^i  II 

o  *  <  (w»  +  |M|)2 

<*X3  -  U/iill  <  »a 

(*»  “  IM)2  ^  »  or  a:  <  ItaH2, 


(7.2) 


(7.3) 
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and 


g-  Ik II2-  v 

2v*IM 


>  -1 


**>(«>*- Ml)2 


o  x*  +  ll/iJI  >  y*  and  y*  >  ||/<1||  - 


(7.4) 


-«•  + II^H)2  >  v  and  (y  >  (H/iill  -  **)2  or  x>||/ij2). 


Let  A  —  { 


HIM*- 

2ci|l/lj|| 


<  1},  B  =  >  -1},  «  {(»*  -  IM)2  <  »}, 


Bi-{x<  H/iill2},  and  Cx  -  {(a*  +  H#«1||)2  >  y}.  Then 


*n*-*rv*- 


We  have,  from  (7.2), 


Frob{x(1)'x(1>  <  x)  =  /  /  — ^ffr(1  “  «2)*,,_l(^r)B  +*A')du 

Jo  I/-00  1  1 


^(y)  dv 


/•<•*+ iip, id*  /■»  rd)  ,  ^  , 

=  /  /  —j~  A(l~«2)  3  ~X  dug(v)  dv 

•M-M*  ^-ir(4=l)jrS 


(7.5) 


+  /{*>IIM*}  /  tf(y)dy. 

JO 
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So,  from  Leibniz’s  rule  for  differentiating  an  integral,  the  density  of  is 

.  (2^  1K11)-1  i(l  -  dv 

r(*f±)jn 


£ 


-L 


(*i+i^‘ii)’  r(|)  A 


(»*-IM)a  r(*f±)jr* 


fffr(2^lkll)-(*-2){[v  -  (xi  - |* ||)2] 


*  ((**  +  !MI)2-»1}**,5(‘>W‘> 


(7.6) 


-L 


(**+l!^ill)*  i 

r(*=* )r( |)(2|K  ||)*-22i* 


x  {[v  -  (xi  -  |K|!)2l((xi  +  |*|)»  -  t,]}HVH(*0  dv, 


where 


,,  x  r(|)2i*  ,  x 
AH  =  -rrr-pM- 


Let  d  =  x  +  ||/*i|]2,  c  =  2x* jj^jj  and  to  =  2y^.  The  density  of xl^xW  is 
c*— 2e~ 


r(4fi)r(|)(2||^||)‘ 


- tt  f  (1  — to2)  ** e~*ewh(cw  +  d)dw 

-22  3*  /-i 


z|-le-l(g+|^1H3)^ 

r(sfi)r(|)2i*  ’ 


(7.7) 


where  I*  =  /.^(l  —  to2)*a*e-»eil,A(eto  +  d)  dto.  And 


r  =  Ai-to2)^5 

Jo 

=  f(  l-to2)^ 

Jo 


e~*ewh(cw  +  d)dw  +  f  (1— to2)  a* 

Jo 


|e-ae*h(cto  +  d)  +  eae*A(— eto  +  d)| 


eiew  h(—cw  +  d)  dto 


dto. 


(7.8) 
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Now  we  make  a  transformation  w  = 

r  =  j\  1  -  a)*?  +  d)  +  c*eti h{-ea»  +  rf)J  da 


a  *  da 


-i  I>-'v 


£ 

y=o 


tb**)’ 


h(ca*  +  d)  +  y; 

y=o 


(M)y 


A(— es*  +  d) 


A(c«*  +  d)  +  A(-c*»  +  rf' 


(M)* 


^22ij!r(i+i) 


_1 
«  * 


+  [a(-c*»  +  d)  -  A(e*»  +  d) j 


«  y  (^C,l)2i  ) 


da 


=  —  /(l- a)-*4 »  «  {  |a(c»»  +  «f)  +  A(-es»  +  rf)j  I_i 

+  |a(-c*»  +  d)  -  h(ea *  +  rf)J  ii 

_  J  (i  _  ,)Ai1  |  +  <f)  +  h{-ca*  +  d)]  J_i 


+  +  d)  —  h(ea *  +  rf)j  Ji  j  da, 
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where  Im  is  the  modified  Bessel  function  of  the  first  kind  and  order  m. 
Hence  the  density  ofxWxW  is 

jy  _  <)4iV!  { +  +  + 

x  I_1  ^a:a  +  |h(-ea»  +  d)  -  h(cas  +  d)j  Ii  ^a||/i1||«sj  j  da, 


(7.10) 


where 


*M  =  -rjir^sM, 

(M*  V  a' 

d  =  x+IKII2,  c  =  2x*||/a1||,  and  Im  is  the  modified  Bessel  function  of  the  first  kind  and 
order  m. 

Two  examples  are  given  in  the  following  : 


Example  1.  Multivariate  normal  distribution. 
rW 


If  x  =  ^  (2)  J  ~  ^  (  J  J  **  *‘t^men8*ona*  vector-  When  px  =  0,  the 

density  of  xOl'xO)  is 


g{v)  -  — t~  l.v»t~1e~«e 

r(f)2i* 


If  px  /  0,  then,  from  (7.10)  ( h(v )  =  1),  the  density  of  xW*xW  is 

2 - ^)2^i - '  4l-l  da 


T('- 


=  t  (i^f)  4  (X'M) exp  +  *)]  - 

where  Jm  is  the  modified  Bessel  function  of  the  first  kind  and  order  m. 


(7.11) 
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=  (  j  has  n-dii 

\y  2/ 


Example  2.  Multivariate  t- distribution. 

Prom  Anderson  and  Fang  (1982a),  if  y„  =  (  j  has  n-dimensicnal  multivariate  t- 
distribution  with  t  degree  of  freedom,  where  y1  is  Jb-dimensional  vector  and  =  0,  then 
\y\Y\  has  F-distribution  with  degrees  of  freedom  k  and  l  and  the  density  of  y/yj  is 

_*±« 

r(|)r(|) 

If  ¥*  0>  then,  from  (7.7),  the  density  of  y/yi  is 

e*“2  r1 

— -  /  (1  —  to2)  * 


'w  “  FTImlT*  ’  (1  +  i) 


_ r(^)^  * 


r(j 


r(|)(i  +  ^)  1 


(lw,+'+0  ’ 


-  Si !/>-'>*  [(- + (-*+'♦«)■*] 


z>  !r(*±i)/j  ^  r(^  +  2j)  . »-» /  i\2^'  _ i  , 

ii¥if(k)Sww(J+<)  •  V1-*’  H  *  ■* 


=  A  r(^  +  ;»r(})  y_„  ,.r(/+ j)r(tf>) 

r(S=*)r(i)r(|)^r(^)2«,!r(,  +  i)1  r(§+,) 

r(l)  ^j!(*+M2+<)“+2'r(f+,)' 


(7.12) 
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